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(57) Abstract 



A polypeptide suitable for use in vaccination against Streptococcal infections comprises: a) the amino acid sequence of SEQ ID No 
I; b) a variant of (a) which is capable of binding an anti-MtsA antibody; or (c) a fragment of (a) or (b) of at least 6 amino acids in length 
which is capable of binding an anti-MtsA antibody. A vaccine composition comprises: a) the amino acid sequence of SEQ ID No I; b) 
a variant of (a) which is capable of generating an immune response to a Streptococcal; or (c) a ^gment of (a) or (b) of at least 6 amino 
acids in lengdt which is capable of generating an immune response against a Streptococcus. 
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STREPTOCXKXAL ABC TRANSPORTER PROTEIN 

Field of the invention 

This invention relates to a novel protein which forms pan of an ABC 
transporter of S.pyogenes, The invention also relates to a streptococcus vaccine 
composition comprising this protein or fragments thereof. 



Background of the Invention 

Streptococcus pyogenes y group A Streptococcus^ is a common human pathogen 

10 which causes a variety of diseases such as pharingitis, impetigo, scarlatina and 

erysipelas. More severe infections caused by this organism are necrotizing fasciitis 
and streptococcal toxic shock like syndrome. 

The superfamily of ABC (ATP-binding cassette) transporters comprise 
many different systems in procaryotes and eukaryotes. This diverse group of 

15 transporters serve many roles including transport of nutrients, translocation of 

signal molecules and chemotaxis. The general principle of ABC transport includes 
transportation of a ligand through two integral membrane domains forming a 
pore, with accompanying ATP hydrolysis by two nucleotide-binding domains 
associated with the cytoplasmic side of the pore. 

20 In bacteria, the translocation of ligands is preceded by interaction with an 

accessory component, the periplasmic binding protein. This protein binds the 
ligand with higher affinity, and then interacts with the integral membrane 
components by releasing the ligand and allowing subsequent transport. In gram- 
positive bacteria, the binding protein homologue is a lipoprotein attached to the 

25 cell membrane by a NH2-terminal lipid moiety. Little is known about the ABC 
transporters in gram-positive bacteria. In particular, the interaction between the 
lipid protein component and the integral membrane component is unclear. 

A number of recent studies have looked at an ABC transporter family in 
Streptococcus species. Examples are Correia etal Infect. Immun (1996) 64(6) 2114- 

30 2121: Fenno et al MoLMicrobiol. (1995) 15(5) 849-863; Lowe et al Infect, Immun 
(1995) 63(2) 703-706; Kolenbrander et al Infea. Immun (1994) 62(ld) 4469^480 
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and Sampson etalMecc, Immun (1994) 62(1) 319024. 

The genes encoding the ABC transporter form an operon consisting of 
three genes. The putative proteins encoded by these genes are a hydrophobic 
membrane protein, a nucleotide binding protein and a lipid protein. The operon 

5 has been sequenced in a number of important disease causing organisms such as 
S.pneumoniaey Enterococcus faecalis, S.sangnis and S.parasanguis as well as in 
commensal bacteria such as S.gordonii and Streptococcus crista. As well as playing a 
role in transport, the ABC transporter has also been related to bacterial virulence 
and may mediate bacterial coaggregation, adhesion to host cells, saliva pellicle 

10 components and fibrin clots. 

Summary of the invention 

The applicants have now identified a new protein of S^pyogenes comprising 
the lipoprotein of an ABC transporter. This protein is herein referred to as the 
15 SmtA protein or MtsA. The S.pyogenes operon is atypically organised and the 
polycistronic transcription is attenuated, in contrast to the previously described 
systems. The lipoprotein can be solubilized from the bacterial surface by 
proteolytic cleavage which indicates the presence of a flexible hinge region 
between the NHj-terminal lipid moiety and a more compact globular fold. This 
20 lipoprotein and fragments thereof can be used in streptococcal vaccine 
compositions and in particular against S^pyogenes, 

In a first aspect, the invention provides a polypeptide which comprises: 

(a) the amino acid sequence of SEQ ID NO 1, 

(b) a variant of (a) which is capable of binding an anti-McsA antibody, or 

25 (c) a fragment of (a) or (b) of at least 6 amino acids in length which is capable 
of binding to an anti-MtsA antibody. 

In another aspect, the invention relates to a vaccine composition 
comprising a polypeptide which comprises: 
(a) the amino acid sequence of SEQ ID NO 1, 
30 (b) a variant of (a) which is capable of generating an immune response to a 
Streptococcus, or 
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(c) a fragment of (a) or (b) of at least 6 amino acids in length which is capable 
of generating an immune response against a Streptococcus. 
In a further aspect, the invention relates to novel polynucleotides having a 

sequence which is: 

5 (i) the nucleotide coding sequence of SEQ ID NO 1 or a sequence 
complementary thereto, 
(ii) a nucleotide sequence which selectively hybridises to a said sequence (i) or 
fragment thereof, or 

(ill) a nucleotide sequence which codes for a polypeptide having the same amino 
10 acid sequence as that encoded by a said sequence of (i) or (ii). 

Polynucleotides are therefore provided which encode a polypeptide.of the 
invention. The invention also provides: 

a recombinant vector comprising a polynucleotide of the invention, 
such as an expression vector in which the polynucleotide is operably linked to a 
1 5 regulatory sequence; 

a host cell which is transformed with a polynucleotide of the 

invention; 

a process of producing a polypeptide of the invention comprising 
maintaining a host cell transformed with a polynucleotide of the invention under 
20 conditions to provide expression of the polypeptide; 

an antibody, monoclonal or polyclonal, specific for a polypeptide as 
defined in claim 1; and 

a method of vaccinating a patient against a Streptococcal infeaion, 
which method comprises administering to the patient an effective amount of a 
25 polypeptide according to the invention. 



Description of the figures 



30 



Fig 1. Organization of the S.pyogenes Iral operon. The three genes in the 
S'pyogenes /r/a/ operon are, from the left (5*), mtsA (lipoprotein), mtsB (ATP- 
binding protein), mtsC (integral meimbrane protein). Arrows indicate the location 
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of primers used to verify the operon organizatioiit and to create probes for each of 
the three genes. In the 66 bp non-coding region between mtsA and mtsB a putative 
stem-loop structure is present. 

Fig 2. Proton-induced X-ray emission analysis (PIXE). PIXE analysis of GST, 
GST:MtsA, MtsA and (iron-saturated) transferrin in millipore water. Results are 
based on two different experiments and shown in mol element per mol protein. 

Fig 3, Binding of "Zn to MtsA. A. MtsA and GST were separately incubated 
with ^^Zn. The mixtures were subjected to gel filtration on a PD-10 column and 
500 fA fractions were collected. Fractions were assayed for protein content and 
radioactivity. Co-migration of ^^Zn and protein was seen in the MtsA sample (top 
graph) but not in the GST samples (bottom graph). 

Fig 4. Plots the results of sheep anti-MtsA antisera ELISA against plates coated 
with MtsA peptides. 

Fig 5, Plots the results of sheep anti-MtsA peptide antisera ELISA against plates 
coated with MtsA protein. 

Fig 6. Plots the results of MtsA peptide blocking of sheep anti-MtsA peptide 
antisera ELISA against plates coated with MtsA protein. 

Description of the sequences 

SEQ ID NO 1 sets out the amino acid sequence for the full length MtsA 
polypeptide of S.pyogenes and the nucleotide sequence encoding this protein. The 
structure of MtsA is discussed in more detail below. 

SEQ ID NO 2 sets out the amino acid sequence for full length MtsA of 
S,pyogenes. 

SEQ ID NO 3 and SEQ ID No 4 are examples of primers which may be 
used in the cloning of polynucleotides encoding MtsA. 
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SEQ ID NOs 5 to 9 are examples of MtsA peptides which may be used to 
generate anti-MtsA antisera. 

Detailed description of the invention 

The invention provides a polypeptide consisting essentially of (a) the amino 
acid sequence of SEQ ID NO 1; (b) a variant of the amino acid sequence of SEQ 
ID NO 1; or (c) a fragment of at least 6 amino acids in length of (a) or (b). 
Typically the polypeptide is capable of binding an anti-MtsA antibody. 

Antibody to MtsA can be raised against purified MtsA protein using 
protein purified direaly from S.pyogenes expressing this protein as described in 
more detail below. Alternatively, protein can be generated recombinantly. 
Following purification of the protein, antibody can be raised in an animal such as a 
rabbit and purified to generate the desired antibody. The antibody can be 
monoclonal or polyclonal antibody. Preferably, the polypeptide of the invention 
is incorporated in a vaccine composition for immunisation against a Streptococcal 
infection. Preferably, the antibody is neutraUsing antibody. Preferably, a 
polypeptide of the invention generates anti-MtsA antibody when administered in 
vivo and provides protection against subsequent Streptococcal infection. 
Preferably, a polypeptide of the invention provides proteaion s^ainst group A 
Streptococcus but may also be used to provide vaccines against other Streptococcus 
such as S.pnuemoniae. 

Polypeptides for incorporation into the vaccine compositions in accordance 
with the invention can be identified by determining whether they bind to an 
antibody specific for MtsA. Alternatively, antibody to a candidate polypeptide 
can be generated by standard techniques, for example by injection of the 
polypeptide into an appropriate animal and subsequent collection and purification 
of antisera from such animals. Antibody which binds MtsA can then be identified 
by standard and competitive immunoassays. The antibody thus identified can 
then be injected into mice to determine if it protects against a lethal challenge with 
a Streptococcal strain, Alternativelyj an otherwise lethal dose of a strain of 
Streptococcus is given in an animal model system in which the animals have been 
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given the relevant polypeptide. 

As noted above a variant polypeptide (b) is one which will bind with an 
anti'MtsA antibody. Alternatively, a variant for incorporation into a vaccine 
composition is one which can be used to generate an immune response to provide 
5 protection against a Streptococcal infection. A variant of SEQ ID No 2 may be a 
naturally occurring variant which is expressed by another strain of S.pyogenes. 
Such variants may be identified by looking for a metal transporter protein in these 
strains which has a sequence which is highly conserved compared to SEQ ID No 
2. Such proteins may be identified by analysis of the polynucleotide encoding such 

10 a protein isolated from alternative strains of S.pyogenes for example by carrying out 
the polymerase chain reaction using primers derived from portions of SEQ ID No 
1. The Examples below demonstrate identification of MtsA derived from a 
number of S.pyogmes strains. Primers hybridizing to portions of the DNA 
sequence of SEQ ID No 1 such as those of SEQ ID No 3 or 4 can be used in the 

15 cloning and sequencing of MtsA from other S.pyogenes strains. MtsA antisera 

generated, for example, using the peptide of SEQ ID No 8 may be used to identify 
MtsA expressed by other S.pyogmes strains. 

Variants of SEQ ID No 1 include sequences which vary from SEQ ED No 
1 but are not necessarily naturally occurring MtsA, Over the entire length of the 

20 amino acid sequence of SEQ ID NO 1, a variant will preferably be at least 80% 
homologous to that sequence based on amino acid identity. More preferably, the 
polypeptide is at least 85% or 90% and more preferably at least 95%, 97% or 99% 
homologous to the amino acid sequence of SEQ ID NO 1 over the entire 
sequence. There may be at least 80%, for example at least 85%, 90% or 95%, 

25 amino acid identity over a stretch of 40 or more, for example 60, 100 or 120 or 
more, contiguous amino acids ("hard homology"). 

Amino acid substitutions may be made to the amino acid sequence of SEQ 
ID NO 1, for example from 1, 2 or 3 to 10, 20 or 30 substitutions. Conservative 
substitutions may be made, for example, according to the following table. Amino 

30 acids in the same block in the second column and preferably in the same line in the 
third column may be substituted for each other: 
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One or more amino acid residues of the amino acid sequence of SEQ ID 
5 NO 1 may alternatively or additionally be deleted. From 1, 2 or 3 to 10, 20 or 30 
residues may be deleted, or more. Polypeptides of the invention also include 
fragments (c) of the above-mentioned sequences. Such fragments retain the ability 
to bind anti-MtsA antibody. Fragments may be at least from 10, 12, 15 or 20 to 
60, 100 or 200 amino acids in length. Particularly preferred fragments comprise; 
10 - the sequence from amino acid residue number 136 through to 

residue 152 of SEQ ID NO 1, having the sequence KQLIAKDPKNKETYEKN 
(SEQ ID No 5); 

the sequence commencing at position 204 through to 222 of SEQ ID 
NO 1, having the sequence EINTEEEGTPDQISSLIEK (SEQ ID No 6); 
15 - the sequence commencing at position 234 of SEQ ID NO 1 through 

to position 249 having the sequence ESSVDRRPMETVSKDS (SEQ ID No 7); 

the sequence commencing at position 259 of SEQ ID NO 1 through 
to position 279, having the sequence TDSIAKKGKPGDSYYAMMKWN (SEQ 
ID No 8); and 
20 - variants of these sequences. 

One or more amino acids may be alternatively or additionally added to the 
polypeptides described above. An extension may be provided at the N-terminus 
or C-terminus of the amino acid sequence of SEQ ID NO 1 or polypeptide variant 
or fragment thereof. The or each extension may be quite short, for example from 
25 1 to 10 amino acids in length. Alternatively, the extension may be longer. A 
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carrier protein may be fused to an amino acid sequence according to the invention. 
A fusion protein incorporating the polypeptides described above can thus be 
provided. 

Polypeptides of the invention may be in a substantially isolated form. It 
5 will be understood that the polypeptide may be mixed with carriers or diluents 

which will not interfere with the intended purpose of the polypeptide and still be 
r^arded as substantially isolated. A polypeptide of the invention may also be in a 
substantially purified form, in which case it will generally comprise the 
polypeptide in a preparation in which more than 90%, e.g. 95%, 98% or 99%, by 
10 weight of the polypeptide in the preparation is a polypeptide of the invention. 

Polypeptides of the invention may be modified for example by the addition 
of histidine residues to assist their identification or purification or by the addition 
of a signal sequence to promote their secretion from a cell where the polypeptide 
does not naturally contain such a sequence. It may be desirable to provide the 
15 polypeptides in a form suitable for attachment to a solid support. For example the 
polypeptides of the invention may be modified by the addition of a cysteine 
residue. 

A polypeptide of the invention above may be labelled with a revealing 
label. The revealing label may be any suitable label which allows the polypeptide 

20 to be detected. Suitable labels include radioisotopes, e.g. *^I, ^''S, enzymes, 

antibodies, polynucleotides and linkers such as biotin. Labelled polypeptides of 
the invention may be used in diagnostic procedures such as immunoassays in order 
to determine the amount of a polypeptide of the invention in a sample. 

Polypeptides or labelled polypeptides of the invention may be used in 

25 serological or cell mediated immune assays for the detection of immune reactivity 
to said polypeptides in animals and humans. Standard protocols can be used. The 
labelled polypeptide may be used to identify and/ or isolate "accessory" proteins 
which are involved in binding between cell receptors and MtsA, by detecting the 
interaction of MtsA to such proteins. 

30 A polypeptide or labelled polypeptide of the invention or fragment thereof 

may also be fixed to a solid phase, for example the surface of an immunoassay well 
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or dipstick. 

Such labelled and/or immobilized polypeptides may be packaged into kits 
in a suitable container optionally including additional suitable reagents, controls or 
instructions and the like. The kits may be used to identify MtsA inhibitors or 
5 activators. 

Such polypeptides and kits may also be used in methods of detection of 
antibodies to the MtsA protein by immunoassay. 

Immunoassay methods are well known in the art and will generally 
comprise: 

1 0 (a) providing a polypeptide comprising an epitope bindable by an antibody 
against said protein; 

(b) incubating a biological sample with said polypeptide imder conditions 
which allow for the formation of an antibody-antigen complex; and 

(c) determining whether antibody-antigen complex comprising said 
1 5 polypeptide is formed. 

The proteins of the present invention may be isolated from S.pyogenes 
expressing the protein. Proteins and peptides of the invention may be prepared as 
fragments of such isolated proteins. The proteins and peptides of the invention 
may also be made synthetically or by recombinant means. The amino acid 

20 sequence of proteins and polypeptides of the invention may be modified to include 
non-naturally occurring amino acids or to increase the stability of the compound. 
When the proteins or peptides are produced by synthetic means, such amino acids 
may be introduced during production. The proteins or peptides may also be 
modified following either synthetic or recombinant production. 

25 The proteins or peptides of the invention may also be produced using D- 

amino acids. In such cases the amino acids will be linked in reverse sequence in the 
C to N orientation. This is conventional in the art for producing such proteins or 
peptides. 

A number of side chain modifications are known in the art and may be 
30 made to the side chains of the proteins or peptides of the present invention. Such 
modifications include^ for example, modifications of amino acids by reduaive 
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alkylation by reaction with an aldehyde followed by reduction with NaBH^, 
amidination with methylacetimidate or acylation with acetic anhydride. 

The polypeptides of the invention may be introduced into a cell by in situ 
expression of the polypeptide from a recombinant expression vector. The 
5 expression vector optionally carries an inducible promoter to control the 
expression of the polypeptide. 

Such cell culture systems in which polypeptides of the invention are 
expressed may be used in assay systems. 

A polypeptide of the invention can be produced in large scale following 
10 purification by high pressure liquid chromatography (HPLC) or other techm'ques 
after recombinant expression as described below. 

A polynucleotide of the invention typically is a contiguous sequence of 
nucleotides which is capable of hybridising selectively with the coding sequence of 
SEQ ID NO 1 (nucleotides 1 to 861) or to the sequence complementary to that 
15 coding sequence. Polynucleotides of the invention include variants of the coding 
sequence of SEQ ID NO 1 which encode the amino acid sequence of SEQ ED NO 
1 and variants and fragments of that sequence which are recognized by antibody to 
MtsA. 

A polynucleotide of the invention and the coding sequence of SEQ ID NO 
20 1 can hybridize at a level significantly above background. Background 

hybridization may occur, for example, because of other cDN As present in a 
cDNA library- The signal level generated by the interaction between a 
polynucleotide of the invention and the coding sequence of SEQ ID NO 1 is 
typically at least 10 fold, preferably at least 100 fold, as intense as interactions 
25 between other polynucleotides and the coding sequence of SEQ ID NO 1. The 

intensity of interaction may be measured, for example, by radiolabelling the probe, 
e.g. with ^'P. Selective hybridization is typically achieved using conditions of 
medium to high stringency (for example 0.03M sodium chloride and 0,03M 
sodium citrate at from about 50^C to about 60**C). 
30 A nucleotide sequence capable of selectively hybridizing to the DNA 

coding sequence of SEQ ID NO 1 or to the sequence complementary to that 
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coding sequence will be generally at least 80%, preferably at least 90% and more 
preferably at least 95%, homologous to the coding sequence of SEQ ID NO 1 or 
its complement over a region of at least 20, preferably at least 30, for instance at 
least 40, 60 or 100 or more contiguous nucleotides or, indeed, over the full length 
5 of the coding sequence. Thus there may be at least 85%, at least 90% or at least 
95% nucleotide identity over such regions. 

Any combination of the above mentioned degrees of homology and 
minimum size may be used to define polynucleotides of the invention, with the 
more stringent combinations (i.e. higher homology over longer lengths) being 

10 preferred. Thus for example a polynucleotide which is at least 85% homologous 
over 25, preferably over 30, nucleotides forms one aspect of the invention, as does 
a polynucleotide which is at least 90% homologous over 40 nucleotides. 

For example the UWGCG Package provides the BESTFTT program which 
can be used to calculate homology (for example used on its default settings). 

15 (Devereux etal (1984) Nucleic Acids Research 12, p387-395). The PILEUP 

and BLAST algorithms can be used to calculate homology or line up sequences 
(such as identifying equivalent or corresponding sequences (typically on their 
default settings), for example as described in Altschul S, F, (1993) J Mol Evol 
36:290-300; Altschul, %¥etd (1990) J Mol Biol 215:403-10. 

20 Software for performing BLAST analyses is publicly available through the 

National Center for Biotechnology Information (http://www.ncbi.nlm.nIh,gov/). 
This algorithm involves first identifying high scoring sequence pair (HSPs) by 
identifying short words of length W in the query sequence that either match or 
satisfy some positive^valued threshold score T when aligned with a word of the 

25 same length in a database sequence. T is referred to as the neighbourhood word 
score threshold (Altschul et al, supra). These initial neighbourhood word hits act 
as seeds for initiating searches to find HSP's containing them. The word hits are 
extended in both directions along each sequence for as far as the cumulative 
alignment score can be increased. Extensions for the word hits in each direction 

30 are halted when: the cumulative alignment score falls off by the quantity X from 
its maximum achieved value; the cumulative score goes to zero or below, due to 
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the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The BLAST program uses as 
defaults a word length (W) of 11, the BLOSUM62 scoring matrix (see Henikoff 
and Henikoff (1992) Pn>c. NatL Acad. Sci USA 89: 10915-10919) alignments (B) 
of 50, expeaation (E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm performs a statistical analysis of the similarity 
between two sequences; see e.g., Karlin and Altschul (1993) Proc, NatL Acad, Sci. 
USA 90: 5873-5787. One measure of similarity provided by the BLAST algorithm 
is the smallest sum probability (P(N))j which provides an indication of the 
probability by which a match between two nucleotide or amino acid sequences 
would occur by chance. For example, a sequence is considered similar to another 
sequence if the smallest sum probability in comparison of the first sequence to the 
second sequence is less than about 1, preferably less than about 0.1, more 
preferably less than about 0.01, and most preferably less than about 0,001. 

Polynucleotides of the invention may comprise DNA or RNA. They may 
also be polynucleotides which include within them synthetic or modified 
nucleotides. A number of different types of modification to polynucleotides are 
known in the art. These include methylphosphate and phosphorothioate 
backbones, addition of acridine or polylysine chains at the 3' and/or 5' ends of the 
molecule. For the purposes of the present invention, it is to be understood that 
the polynucleotides described herein may be modified by any method available in 
the art. 

Polynucleotides of the invention may be used to produce a primer, e.g a 
PGR primer, a primer for an alternative amplification reaction, a probe e.g. 
labelled with a revealing label by conventional means using radioactive or non- 
radioactive labels, or the polynucleotides may be cloned into vectors. Such 
primers, probes and other fragments will be at least 15, preferably at least 20, for 
example at least 25, 30 or 40 nucleotides in length, and are also encompassed by the 
term polynucleotides of the invention as used herein, SEQ ID Nos 3 and 4 are 
examples of primers of the invention. 
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Polynucleotides such as a DNA polynucleotide and primers according to 
the invention may be produced recombinantly, synthetically, or by any means 
available to those of skill in the art. They may also be cloned by standard 
techniques. The polynucleotides are typically provided in isolated and/ or purified 
5 form, 

In general, primers will be produced by synthetic means, involving a step 
wise manufacture of the desired nucleic acid sequence one nucleotide at a time. 
Techniques for accomplishing this using automated techniques are readily available 
in the art. 

10 Longer polynucleotides will generally be produced using recombinant 

means, for example using PGR (polymerase chain reaction) cloning techniques. 
This will involve making a pair of primers (e.g. of about 15-30 nucleotides) to a 
region of the mtsA gene which it is desired to clone, bringing the primers into 
contact with DNA obtained from a bacterial cell, performing a polymerase chain 

15 reaction under conditions which bring about amplification of the desired region, 
isolating the amplified fragment (e.g. by purifying the reaction mixture on an 
^arose gel) and recovering the amplified DNA. The primers may be designed to 
contain suitable restriction enzyme recognition sites so that the amplified DNA 
can be cloned into a suitable cloning vector. 

20 Such techniques may be xised to obtain all or part of the mtsA gene 

sequence described herein. Genomic clones containing the mtsA gene and its 
promoter region may also be obtained in an analogous manner, starting with 
genomic DNA from a bacterial cell. 

Although in general the techniques mentioned herein are well known in 

25 the art, reference may be made in particular to Sambrook et aly 1989. 

Polynucleotides or primers of the invention may carry a revealing label. 
Suitable labels include radioisotopes such as '"P or "S, enzyme labels, or other 
protein labels such as biotin. Such labels may be added to polynucleotides or 
primers of the invention and may be deteaed using techniques known perse. 

30 Polynucleotides or primers of the invention or fragments thereof, labelled 

or unlabelled, may be used by a person skilled in the art in nucleic acid-based tests 
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for detcCTing or sequencing MtsA in a sample. 

Such tests for detecting generally comprise bringing a sample containing 
DNA or RNA into contact with a probe comprising a polynucleotide or primer of 
the invention under hybridizing conditions and detecting any duplex formed 
5 between the probe and nucleic acid in the sample. Such deteaion may be achieved 
using techniques such as PGR or by immobilizing the probe on a solid support, 
removing nucleic acid in the sample which is not hybridized to the probe, and 
then detecting nucleic acid which has hybridized to the probe. Alternatively, the 
sample nucleic acid may be immobilized on a solid support, and the amount of 

10 probe bound to such a support can be detected. 

The probes of the invention may conveniently be packaged Ln the form of a 
test kit Ln a suitable container. In such kits the probe may be bound to a solid 
support where the assay formats for which the kit is designed requires such 
binding. The kit may also contain suitable reagents for treating the sample to be 

15 probed, hybridizing the probe to nucleic acid in the sample, control reagents, 
instructions, and the like. 

Polynucleotides of the invention can be incorporated into a recombinant 
replicable vector. The vector may be used to replicate the nucleic acid in a 
compatible host celL Thus in a further embodiment, the invention provides a 

20 method of making polynucleotides of the invention by introducing a 

polynucleotide of the invention into a replicable vector, introducing the vector 
into a compatible host cell, and growing the host cell under conditions which 
bring about replication of the vector. The vector may be recovered from the host 
cell. Suitable host cells are described below in connection with expression vectors. 

25 Preferably, a polynucleotide of the invention in a vector is operably linked 

to a control sequence which is capable of providii^ for the expression of the 
coding sequence by the host cell, i.e. the vector is an expression vector. Such 
expression veaors can be used to express the polypeptide of the invention. 
The term "operably linked" refers to a juxtaposition wherein the 

30 components described are in a relationship permitting them to function in their 
intended manner. A control sequence ''operably linked" to a coding sequence is 
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ligated in such a way that expression of the coding sequence is achieved under 
conditions compatible with the control sequences. 

Such vectors may be transformed into a suitable host cell to provide for 
expression of a polypeptide of the invention. Thus, a polypeptide according to the 
invention can be obtained by cultivating a host cell transformed or transfeaed 
with an expression vector as described above under conditions to provide for 
expression of the polypeptide, and recovering the expressed polypeptide. 

The vectors may be for example, plasmid, virus or phage vectors provided 
with an origin of replication, optionally a promoter for the expression of the said 
polynucleotide and optionally a regulator of the promoter. The vectors may 
contain one or more selectable marker genes, for example an ampicillin resistance 
gene in the case of a baaerial plasmid. Promoters and other expression regulation 
signals may be selected to be compatible with the host cell for which the 
expression veaor is designed. 

Host cells transformed (or transfected) with the polynucleotides or veaors 
for the replication and expression of polynucleotides of the invention will be 
chosen to be compatible with the said veaor and preferably will be bacterial such 
as Exoli. Alternatively they may be cells of a human or animal cell line such as 
CHO or COS cells, or yeast or insect cells. The cells may also be cells of a non- 
human animal such as a sheep or rabbit or plant cells. 

The polypeptides of the invention are useful for vaccinating against a 
Streptococcal infection, for example against group A Streptococcus or 
S.pneumoniae. A vaccine of the invention comprises a suitable polypeptide and a 
pharmaceutically acceptable carrier or diluent. The preparation of vaccines which 
contain an immunogenic poIypeptide(s) as aaive ingredient(s), is known to one 
skilled in the art. Typically, such vaccines arc prepared as injectables, either as 
liquid solutions or suspensions; solid forms suitable for solution in, or suspension 
in, liquid prior to injection may also be prepared. The preparation may also be 
emulsified, or the protein encapsulated in liposomes. The active immunogenic 
ingredients are often mixed with exdpients which are pharmaceutically acceptable 
and compatible with the aaive ingredient. Suitable excipients are, for example, 
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water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In 
addition, if desired, the vaccine may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, and/ or 
adjuvants which enhance the effeaiveness of the vaccine. 
5 Examples of adjuvants which may be effective include but are not limited 

to: aluminium hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamin (thr- 
MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as 
nor-MDP), N-acetylmuramyl'L-alanyl-D-isoglutamnyl-L-alanine-2-(r-2'- 
dipalmitoyl-sn-glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 19835A, 

1 0 referred to as MTP-PE), and RIBI, which contains three components extracted 
from baaeria, monophosphoryl lipid A, trehalose dimycolate and cell wall 
skeleton (MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. The 
effectiveness of an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide containing MtsA 

1 5 antigenic sequence resulting from administration of this polypeptide in vaccines 
which are also comprised of the various adjuvants. 

The vaccines are conventionally administered parentally, by injection, for 
example, either subcutaneously or intramuscularly. Additional formulations 
which are suitable for other modes of administration include suppositories and, in 

20 some cases, oral formulations. For suppositories, traditional binders and carriers 
may include, for example, polyalkylene glycols or triglycerides; such suppositories 
may be formed from mixtures containing the active ingredient in the range of 
0.5% to 10%, preferably 1% to 2%, Oral formulations include such normally 
employed excipients as, for example, pharmaceutical grades of mannitol, lactose, 

25 starch, magnesium stearate, sodium saccarine, cellulose, magnesium carbonate, and 
the like. These compositions take the form of solutions, suspensions, tablets, pills, 
capsules, sustained release formulations or powders and contain 10% to 95% of 
active ingredient, preferably 25% to 70%. Where the vaccine composition is 
lyophilised, the lyophilised material may be reconstituted prior to administration, 

30 e.g. as a suspension. Reconstitution is preferably effected in buffer. 

Capsules, tablets and pills for oral administration to a patient may be 
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provided with an enteric coating comprising, for example, Eudragit "S", Eudragit 
*'L", cellulose acetate, cellulose acetate phthalate or hydroxypropylmethyl 
cellulose. 

The polypeptides of the invention may be formulated into the vaccine as 
5 neutral or salt forms. Pharmaccuticaily acceptable salts include the acid addition 
salts (formed with free amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
organic acids such as acetic, oxalic, tartaric and maleic. Salts formed with the free 
carboxyl groups may also be derived from inorganic bases such as, for example, 
10 sodium, potassium, anmionium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine and 
procaine. 

The vaccines are administrated in a maimer compatible with the dosage 
formulation and in such amount as will be prophylaaically effective. The 

1 5 quantity to be administered, which is generally in the range of 5.ag to lOOmg, 

preferably 250/^g to lOmg, of polypeptide (antigen) per dose, depends on a number 
of factors. These include the subject to be treated, capacity of the subject*s 
immune system to synthesize antibodies, and the degree of protection desired. 
Precise amounts of aaive ingredient required to be administered may depend on 

20 the judgement of the practitioner and may be peculiar to each subject. 

The vaccine may be given in a singe dose schedule, or preferably in a 
multiple dose schedule. A multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed by other doses given at 
subsequent time intervals required to maintain and or reinforce the immune 

25 response, for example at 1 to 4 months for a second dose, and if needed, a 

subsequent dose(s) after several months. The dosage regimen will also, at least in 
part, be determined by the need of the individual and be dependent upon the 
judgement of the practitioner. 

The nucleotide sequences of the invention and expression vectors can also 

30 be used as vaccine formtdations as outlined above. The vaccines may comprise 

naked nucleotide sequences or be in combination with cationic lipids, polymers or 
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targeting systems. The vaccines may be delivered by any technique suitable for 
delivery of nucleic acid vaccines. 

The inununogenic polypeptides prepared as described above can be used to 
produce antibodies, both polyclonal and monoclonal- If polyclonal antibodies are 
desired, a selected mammal (e.g., mouse, rabbit, goat, horse, etc.) is immunised 
with an immunogenic polypeptide of the invention. Serum from the immunised 
animal is collected and treated according to known procedures. If serum 
containing polyclonal antibodies to the polypeptide contains antibodies to other 
antigens, the polyclonal antibodies can be purified by immunoaffinity 
chromatography. Techniques for producing and processing polyclonal antisera are 
known in the art. 

Monoclonal antibodies directed against Streptococcal epitopes in the 
polypeptides of the invention can also be readily produced by one skilled in the 
art. The general methodology for making monoclonal antibodies by hybridomas 
is well known. Immortal antibody-producing cell lines can be created by cell 
fusion, and also by other techniques such as direct transformation of B 
lymphocytes with oncogenic DN A, or transfection with Epstein-Barr virus. 
Panels of monoclonal antibodies produced against polypeptides of the invention 
can be screened for various properties; i.e., for isotype and epitope affinity. 
Preferably the antibody is specific for a MtsA protein epitope. 

Antibodies, both monoclonal and polyclonal, which are directed against 
polypeptides of the invention are particularly useful in di^nosis, and those which 
are neutralising are useful in passive immunotherapy. Monoclonal antibodies, in 
particular, may be used to raise anti-idiotype antibodies. Anti-idiotype antibodies 
are immunoglobulins which carry an "internal image" of the antigen of the 
infectious agent against which protection is desired. 

Techniques for raising anti-idiotype antibodies are known in the art. 
These anti-idiotype antibodies may also be useful for treatment of Streptococci, as 
well as for an elucidation of the immunogenic regions of polypeptides of the 
invention. 

It is also possible to use fragments of the antibodies described above, for 
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example, Fab fragments. Antibodies generated to a peptide of the Invention may 
be administered to an individual to treat GAS infeaion by passive immuno 
therapy. The antibodies of the invention may be formulated with a 
pharmaceutically acceptable carrier and delivered in the same way as set out above 
5 for the vaccine compositions. Preferably the antibody is administered in an 
amount effective to ameliorate GAS infection in the individual. 

The following Examples illustrate the invention. 
Examples 

Experimental procedures used in the following examples 

1 0 Bacterial cultures 

The S,pyogen€S strains used in this study (serotypes Ml, M4, M9, M12, 
M49) are from the World Health Organization Centre for references and research 
on Streptococci, Institute of Hygiene and Epidemiology, Prague, Czech Republic. 
The S.pyogenes strain SF370 is being sequenced in the Streptococcal Genome 

15 Projea, and can be obtained from ATCC (700294). Streptococci were grown in 
Todd-Hewitt broth (Difco, Detroit, MI), supplemented with 0.2% yeast extract 
(Difco) in 5% CO^ at Exoli strain BL21 (Pharmacia Biotech, Uppsala, 

Sweden) was grown in Luria-Bertani or 2X YT broth or agar, aerobically at 37° C 
and supplemented with 100 A^g/ml ampicillin (Sigma, St Louis, MO) or glucose 

20 where appropriate, 

PCR, cloning procedures^ and sequencing 

Chromosomal DNA from S,pyogenes strains was extracted as described in 
Pitcher et al, Lett. App. Microbiol 1989 8 pl51-156 modified in the initial 
incubation step by addition of 1000 U/ml of mutanolysin (Sigma) and 100 mg/ml 

25 lysozymc (Sigma) « Oligonucleotide primers were designed by using sequence 
information from the Streptococcal Genome Project database, together with 
published sequences from other Streptococcus spp. Primers 5'-TAG-TAG-CGA- 
ATT-CGT-CGA-CTG-GCG-CTA-3' and 5^AGC-ACA-ACT^CGA-GAA-TCG- 
CTG-TGC-TTT- A-3 ' enclose almost the whole of mtsA (excluding the signal 

30 peptide and the NHj-terminal cysteine residue) and were designed with an EcoRl 
and Xhol restriction site, respeaively. Primers S'-GAT-TAC-AACTAA-CAA- 
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TCT-TTG-TGT-G AC-C-3 ' , 5 '-TTG- AC A-AGaTAT-CAA-C AG-TAA-ATA^ 
CCT^C-3\ 5'-ATG-TCA/T-CTC/T-ATG'GGA/G/T-GAT-GCC-ATC-3', and 
5'-TTA/G-GCA-TAT/G-AG/AA-TAA/G-GCC/T-GTC^GC-3' were designed 
from internal segments of the genes mtsB and mtsC. PGR experiments were 
5 performed using Taq polymerase (Gibco-BRL, Gaithersburg, MD), except for 
cloning purposes, when TaqPlus Precision^ (Strategene, La Jolla, CA) was used. 
The PGR product corresponding to mtsA was gel-purified prior to cloning, using 
Sephaglas™ Bandprep Kit (Pharmacia Biotech). The PGR profile consisted of 30 
cycles at 94'^C for 1 min, 50°C for 1 min, and 72^C for 2 min, followed by aiinal 

10 extension at 72 ''C for 7 min. Plasmid purification, restriction enzyme digestions, 
ligation, electroporation and screening of transformants were all performed 
according ro standard procedures, or when applicable, according to instructions in 
the GST fusion protein kit (Pharmacia). Sequencing of the cloned insert was 
performed on an ABI-470 Prism with dyed dideoxy terminators, at Innovagen AB, 

1 5 Lund, Sweden- 

Overexpression and purification of recombinant MtsA 

. An E,coli strain carrying the recombinant plasmid pGEX-5X-3:?ni3i4 was 
grown at 37*'C overnight. This preculture was then inoailated 1:100 in 
prewarmed 2 x YT containing 100 ,ug/ ml ampicillin. The culture was grown at 

20 30°C until OD^oo ^ L2. Induction was then started by adding 0.5 mM IPTG 

(Promega, Madison, Wl). After an additional 4 h of incubation the bacteria were 
pelleted by centrifugation at 8000 x g for 10 min, washed, and resuspeaded in PBS 
buffer, and lysed by sonication with a Branson sonifier B15 (Heimstatt, Germany). 
Triton X-100 was added to a final concentration of 1%, and the sample was gently 

25 mixed for 30 min. Cell debris was pelleted by centrifugation at 12000 x g for 10 
min, and the supernatant was applied to a pre-equilibrated glutathione-sepharose 
suspension. All washes were performed in batch format, by centrifugation at 1000 
X g for 5 min. GST:MtsA fusion protein was eluted with reduced glutathione. 
Alternatively, factor Xa cleavage (Pharmacia Biotech) was performed in 50 niM 

30 Tris-HCl, pH 7.5, 150 mM NaCl, 1 mM GaCl., overnight (1 U factor Xa/ml 

bacterial sonicate supernatant). Cleaved MtsA protein was then eluted in repeated 
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sceps with PBS. 
RNA methods 

Total RNA from S.pyogenes was purified using Fastprep™ cell disrupter 
(Savant, Holbrook, NY). Briefly, bacteria were cultured in THY medium until 
5 mid-logarithmic or early stationary phase, harvested by centrifugation at 3,800 x g 
for 10 min at 4°C, and resuspended in PBS, followed by disruption for 2 x 30 
seconds at setting 6.0 using FastRNA™ kit with glass beads (BIO 101, Vista, CA) 
according to the manufacturers instructions. 

For Northern blot experiments, RNA was separated on 1% agarose in 1 x 

10 HEPES buffer, blotted onto Hybond-N filters (Amersham, Amersham, UK), and 
hybridized with 600-900 bp long DNA probes specific for mtsA, mtsB and mtsC 
(see above). The PGR products (see above) were purified on a MicroSpin™ S-200 
HR column (Pharmacia, Uppsala, Sweden), and radiolabelled with [a-^^]dATP 
using the Megaprime^^^ kit (Amersham). To verify that equal amounts of RNA 

1 5 were present on the filter, it was also probed with a radiolabelled 800 bp probe 
specific for 16S rRNA. 
Protein methods 

Protein samples were separated by 12% SDS-PAGE, Gels were then 
soaked in blotting buffer (20% ethanol, 200 mM glycine, 25 mM Tris), and 

20 proteins were blotted to an Immunobilon-P'^ PVDF-membrane (Millipore), 

Membranes were then blocked in 15 ml of PBS, 0,05% Tween>20, 5% (w/v) skim 
milk for 20 min at room temperature. The primary antibody (see below) was 
added, diluted 1:1000 in PBS, 0.05% Tween-20 (PBST), and the membrane was 
incubated at 37 for 30 mins. Membranes were then washed in PBST, 3x5 min 

25 at 37 °C. Horseradish peroxidase-conjugated antirabbit goat antibodies (Bio-Rad, 
Bio-Rad laboratories CA) diluted 1:3000 in PBST were added and the membrane 
was incubated for 30 min at 37''C, and then washed as above. 

Proteolytic digestion of bacteria was performed on overnight cultures of S. 
pyogenes. Bacteria were pelleted by centrifugation at 3000 x g for 5 min, at 8^C, 

30 resuspended in cleavage buffer (0,01 M Tris-HCl, pH 8), and washed twice as 

above before being resuspended in cleav^e buffer. Papain (Sigma) was added (0 - 
200 ^g/2 X 10' bacteria), and digestion was initiated by the addition of L-cysteine 
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(55 mM). The suspension was incubated at for 1 h. The digestion was 
stopped by adding lodoacetamide to a final concentration of 12 fiM, Bacteria were 
then pelleted by centrifugation at 4800 x g for 10 min, and the supernatant was 
removed and filtered through a 0.2 /im Acrodisc (Gelman Sciences, Ann Arbor, 
5 MI). After freezing and thawing, samples precipitated spontaneously. This was 
used as a convenient method of concentration, as SDS-PAGE analysis indicated 
that the precipitate contained all the proteins present in the original solubilized 
sample. Enzymatic digestion with streptococcal cysteine proteinase was 
performed essentially as described in Bej^e and Bjorck, J. Biol Chem (1995) 270 
10 9862-9867. 
Metal Assays 

Samples for PIXE analysis were prepared as follows. One ml of GST, 
MtsA fusion, and MtsA suspended in PBS was ultrafiltrated in Centricon-10 cells 
(Amicon, Inc., Beverly, MA) four times, each time reducing the volume to less 

15 than one tenth, and adding Millipore water up to L5 ml after the first three spins. 
The final protein concentration was determined using Coomassie® Protein Assay 
Reagent kit (Pierce, Rockford, IL). Bovine iron-saturated holo-Transferrin (Sigma) 
was suspended in Millipore water. Then, 20 and 60 ^g of GST, MtsA fusion and 
MtsA, respectively, in 25 ^1 of Millipore water was added to a Kimfoil (Kimberley 

20 Clark), mounted on a plastic frame, and allowed to dry. 200 fig of transferrin in 

25 fil of Millipore water was similarly prepared. At the Lund Nuclear Microprobe 
the samples were placed in the vacuum irradiation chamber and bombarded with a 
2 nA proton beam, having an energy 2.55 MeV, accumulating a beam charge of 0.6 
/xC. The characteristic X-rays emitted were detected using a Kevex 50 mm' Si(Li) 

25 X-ray detector, while data were collected using a C AMAC/Mac-computer system 
equipped with KMAX (Sparrow) software. Elemental standards (Fe, Co, Ni) were 
analysed in the same batch to verify quantification. 

Gel filtration experiments were performed with PD-10 columns (Pharmacia 
Biotech). 20 fig protein (GST or MtsA) in PBS, supplemented with 0,25 M NaCl, 

30 was incubated overnight at 4*^0 with approx. 0.5 ^Ci of ^Mn, ^^Zn or '^e 

(Amersham). The sample was applied to a gel filtration column, preequilibrated 



WO0<V40729 PCT/GB99/a4445 

-23- 

with PBS> 0.25 M NaCl> and 0.5 ml fractions were eluted with the same buffer. 
Each fraction was then assayed for protein content, using Coomassie® Protein 
Assay Reagent kit (Pierce), and radioactivityj by adding Ready Safe™ scintillation 
fluid (Beckman Instruments, Fullerton, CA) and counting ^-emission on a 
Beckman Instruments LS6000TA. 

Slot blot experiments were performed by applying 10, 1 and 0.1 of 
protein (MtsA fusion and GST) to a nitrocellulose membrane, using a slot blot 
• apparatus (Millipore). Membranes were equilibrated in 0.1 M citrate buffer, pH 
6.2, (appr. 0.1 M Na"), or 0.1 M maleic acid, pH 6.5, 0.1 M NaCl (for iron- 
binding) and then suspended in the same buffer with 1 fiCi/ml of radioisotope 
(^'Fe, ^'^Mn or ^^Zn), for 1 h at room temperature and with gentle mixing. In 
competition experiments 1 mM of Fe(n) sulfate, Fe(IIl) citrate, MnClj, CuCl2 or 
ZnClj was added. Then, membranes were washed for 2 x 15 min in buffer, and 
exposed on a phosphoimaging plate (Fuji Photo Film Co. Ltd., Japan). 
Quantification of bound radioaaivity was performed using the Bio-Imaging 
Analyzer BAS2000 program package (Fuji Photo Film). 
Other Met/yods 

NH^-terminal sequencing (Edman degradation) was performed at the 
Biomedical Service Unit, Lund University. The peptide QDPHEYEPLPEDV 
SEQ ID No 9 was synthesized, analysed for purit)'' and correct sequence. 

Rabbits were immunized with peptide-KLH conjugates. Primary 
immunization was performed with 100 /xg peptide-KLH conjugate in Freunds 
complete adjuvant (Sigma). The two booster immunizations were done at weeks 4 
and 6, using 100 fi% peptide-KLH conjugate in Freunds incomplete adjuvant 
(Sigma). 

Example 1 - Identification and Sequence analysis of a 5. pyogenes member of the Iral 
family 

Predicted amino acid sequences of the proteins encoded by the various Iral 
operons were used to search (tBLASTn) the Streptococcal Genome Project 
database, at the time of 95% completion. The products of four adjacent open 
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reading frames (ORF), showed strong homology to the Lral proteins. A frame 
shift split one gene (lipoprotein) in two ORF's but subsequent sequencing {see 
below) of a serotype Ml strain showed that the database sequence was incorrect, 
containing a single base insertion in a region of reduced database sequence quality. 
The (corrected) three ORFs were named mtsABC (streptococcal metal 
transporter). 

The three ORFs encode proteins typical of an ABC transport system, 
MtsA contains a putative hydrophobic signal peptide and a consensus sequence 
{LXXQ typical of NH^-terminal lipid linkage bacterial lipoproteins. MtsB has an 
ATP-binding cassette, while MtsC is a highly hydrophobic protein with 6-7 
potential transmembrane domains. Homologies between the putative proteins of 
5. pyogenes and their counterparts in other Streptococcus spp. were in the same 
range as pre^aously described for the lral family, as noted in the Table below. The 
percentages indicate sequence identity at the protein level. An indicates only 
partial sequences available. 

Bacterium Lipoprotein ATP-binding protein Integral membrane protein 



S.pneumoniae 72% 51%* 70% 

S.gordonii 73% 56% 78% 

S.parasanguis 72% 54% 75% 

S.sanguis 74% 75%"- 

Sxrista 72% 73%'-^ 

E.faecalis 55% 



Notably, the arrangement of the genes in the 5. pyogenes operon was 
atypical. The lipoprotein gene is at the 5' -end of the operon, whereas in all other 
operons it is at the 3 '-end. A publication on an oligopeptide permease ABC- 
transporter in 5. pyogenes Polbielski et al Mol. Microbiol. 1996 21(5) 10874099 
showed a differential transcription of the lipoprotein component. A putative 
stem-loop structure (-60.2 kj mol ') (Fig. IB) similar to Rlio-independent 
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transcription terminators was identified in the 66 bp non-coding region between 
MtsAmdMtsBC. 

The atypical organization was verified by PGR experiments. Forward and 
reverse primer pairs from each of the three genes were used to ampUfy products 
5 with template DNA from five different strains (serotypes Ml> M4, M9> and 
M49) of S. pyogenes, in combinations so as to allow every possible arrangement of 
the three genes in the operon. All experiments gave single products of a size 
consistent only with the atypical operon seen in the database (data not shown). 

The GENBANIC database was searched for homologues of MtsA, MtsB and 

10 MtsC, using the BLAST algorithm. The highest scoring matches were all ABC 
transporters specific for di-or trivalent cations. Specifically> several iron- 
siderophore transport systems showed a high degree of homology to the 
streptococcal proteins. MtsA has a 34% identify with YfeA (Bearden etalj, 
Bacteriol 1998 180(5) 1135-1147), the periplasmic binding protein of an iron 

1 5 (chelated) transport system in Yersinia pestis (also in Haemophilus influenza^. An 
inversely directed search, using YfeA to search for homologues in the 
Streptococcal Genome database, identified MtsA as the best homologue on the 
streptococcal genome. TroA (Hardman etal Gene 1997 197 47-64) from 
Treponema pallidum also shows homology (27% id. and 17% sim.) with MtsA, and 

20 is part of an ABC transporter operon flanked by iron-regulated transcription 
factors. Additionally, a recently described iron-regulated ABC transporter in 
Staphylococcus epidermidis (Cockayne et al Infect Immun 1998 66 3767-3774} has a 
high degree of homology (the 5. epidermidis lipoprotein is 52% identical to MtsA) 
CO Iral protein, and should possibly be included in this family. 

25 Two homologous ABC transporters not involving iron were also found: 

the manganese transporter MntABC (Bartsevich and Pakrasi EMBO J 1995 14(9) 
1845-1853) from Synechococcus cystisy and the zinc transporter AdcCBA from 5. 
pneumoniae^ where MtsA showed appr. 30% identity with the corresponding 
proteins, MntC and AdcA. A comparison of MtsC with conserved motifs of 

30 integral membrane proteins from ABC transporters (Saurin etalMol Microbiol 
1994 12 993-1004) showed that the best fitting (21% id. and 21% sim. on 43 aa) 
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motif was from the cluster of iron-siderophore transporters. 



Example 2 - Transcriptional analysis of mtsABC 

In order to investigate the transcription of the three genes in the 5. pyogenes 
5 Iral operon, probes corresponding to internal sequences of mtsA^ mtsB and mtsC 
were produced and labelled. Total RNA was extracted from serotype Ml S. 
pyogenes bacteria grown to niid-logarithmic and early stationary phase. Northern 
blot experiments (data not shown) showed that all three probes reacted weakly 
with a transcript approximately 2.5 kb in size when RNA from midJog phase 

10 bacteria was used, consistent with a polycistronic transcription of the operon. 

However, the tntsA probe also reaaed with a shorter transcript, approximately 1 
kb In size. This shorter transcript was present in higher (10 -20 times) amounts 
than the polycistronic transcript, suggesting that the stem-loop struaure in the 
non-coding region between mtsA and mtsB can terminate transcription. The MtsA 

15 probe showed weak reaaivity with the short transcript when RNA from bacteria 
in early stationary phase was used, whereas the two other probes did not hybridi^ 
at all. A control hybridization with a probe for 16S rRNA was done to verify that 
RNA levels in the samples (mid-log and early stationary phase of growth) were 
equal (data not shown). Taken together, these results indicate that the lipoprotein 

20 is expressed in higher quantities than the ATPase and hydrophobic membrane 
protein. 

Example 3 - Surface localization of the MtsA lipoprotein 

We produced the synthetic peptide QDPHEYEPLPEDV^ spanning a 

25 highly conserved region of MtsA, predicted to have good antigenicity. A rabbit 
was immunized with the peptide, and following three boosters the senun showed 
good reactivity with the peptide in ELISA, as compared with preimmune serum 
(data not shown). Then, a proteolytic digestion of 5. pyogenes bacteria was 
performed, to investigate whether a protein fragment from the lipoprotein could 

30 be identified using the peptide antiserum. The protein fragments released by 

proteolytic digestion were subjeaed to SDS-PAGE and then transferred to PVDF- 
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membrane by Western blotting. 

Immunodetection with the peptide antiserum identified a protein fragment 
With an apparent molecular mass of 36 kDa, solubilized at high concentrations of 
papain (data not shown). The protein seemed fairly resistant to proteolytic 
digestion, since it remained at the same position even at high papain 
concentrations when most of the proteins had shifted to the low molecular weight 
rangej supposedly degraded by the excess of protease. Protein from the 36 kDa 
band was subjected to NHj-terminal amino acid sequencing. The result 
{KSDKLKWAT, aa HO of the 36kDa papain fragment) showed a 90% identity to 
a region very close to the NH^-terminus in the predicted MtsA protein (amino 
acids 30-39) of the database (ESDKLKVVAT), and a 100% identity to the predicted 
MtsA protein (amino acids 30-39) from the sequence of the strain studied (see 
below). The predicted molecular mass of the mature MtsA polypeptide was 32 
kDa. Thus, papain seems to cleave MtsA very close to the protein's predicted 
NHj-terminal lipid anchor, liberating almost the whole polypeptide from the 
bacterial cell surface. A similar papain digestion and western blot was also 
performed with the strain SF370 sequenced in the Streptococcal Genome Project, 
with the same result. 

Bacterial growth media from overnight cultures of 5. pyogenes was also 
examined for presence of MtsA, since bacterial lipoproteins are sometimes found 
in medium as welL No reactivity with the peptide antiserum was seen in TCA- 
precipitated proteins from medium. Also, mild detergent treatment of bacteria 
failed to solubiUze any protein reacting with the antiserum. 

5. pyogenes produces and secretes a cysteine proteinase, SCP. This protease 
has previously been shown to release functionally active fragments of streptococcal 
surface proteins. We performed proteolytic digestions of 5. pyogenes with purified 
SCP. Solubilized proteins were visualized by SDS-PAGE and analysed by 
immunoblotting. There was no reactivity with the antiserum. To exclude that 
MtsA had been completely degraded, a double digestion was performed, first with 
SCP and then with papain. The amount of MtsA released, as judged by staining 
and immunoblotting, was unaffected by pretreatment with SCP (data not shown). 
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Example 4 - Cloning and purification of recombinant MtsA 

The mtsA gene from strain API was PCR-amplified and cloned into pGEX 
and sequence analysis confirmed the presence oiMtsAy showing 98% (amino acid) 
Identity with the database sequence, well in Une with what could be expected from 
5 two different strains of the same serotype (compare psaK from S. pneumoniae 

Berry and Paton Infea. Immun. 1996 64(12) 5255-5262. There was no frame shift 
in the sequenced ORF, indicating that the preliminary database sequence indeed 
contained an incorrect base insertion (see above). Also, the supposed error is 
found in a region where the database sequence is less accurate. MtsA was purified 
10 by affinity chromatography of over expressed MtsA fusion and subsequent 

proteolytic cleavage with factor Xa. The purified MtsA had an apparent molecular 
mass of 36 IcDa, and comprised > 95% of the protein content in the sample (data 
not shown). 

Computer predictions (Robson-Garnier and Chou-Fasman algorithms) of 
1 5 MtsA secondary structure suggest a predominantly a-helical structure (43-60% of 
the protein). CD-spearoscopy of MtsA confirmed these predictions (data not 
shown). 

Example 5 - Analysis of trace element content and metal binding properties of MtsA 
20 A recent publication on a periplasmic molybdate-binding protein of E. coli 

Rech etal]. Biol Chem 1996 271 2557-2562 showed that binding of the ligand 
changed the migration of the protein in native PAGE. Similar experiments were 
performed with MtsA, but no consistent effects on migration properties was seen 
with any of the tested potential ligands (Mn(II), Fe(III), Cu(II), Zn(II), Co(Il), 
25 Ca(II)), nor with EDTA (data not shown). We performed a quantitative analysis 
of trace element content in MtsA, MtsA fusion, GST and (iron-saturated) 
transferrin by the use of a highly specific and sensitive technique, proton-induced 
X-ray emission (PIXE). PIXE analysis indicated that iron was present in 
approximately 1.5 molar ratio compared to protein in the MtsA sample (Fig. 2). 
30 Little or no iron was detected in the other samples, except for transferrin. Among 
other trace elements only copper was present in significant amounts. Both MtsA 
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fuslon and MtsA contained copper in approximately 0.5 molar ratio to protein. In 
a similar analysis, using a 0.1 M Tris-Ac buffer, pH 7.5, a significant Zn content 
was found (data not shown). 

In addition, 30 solutions of GST and MtsA were analysed for iron 
content at a routine clinical chemistry laboratory. The sample with MtsA 
contained 20 /iM iron, indicating a 67% saturation of the protein. To rule out that 
iron contamination from the factor Xa solution used for cleaving the fusion 
protein was affecting the results, a sample containing cleavage solution, reduced 
glutathione, and PBS was analysed for iron content. Iron concentration in that 
sample was <5 ^M (detection limit). 

MtsA was subjected to proteolytic digestion with trypsin, papain, and 
proteinase K (5:1 ratio SmtA/pro tease). Digestion patterns of MtsA with/ without 
pretreatment with EDTA were compared, to elucidate whether a potentially 
present cation ligand affected the conformation of the protein so as to change 
accessibility of proteolytic cleavage sites. No such effect was seen (data not 
shown). A relative resistance to proteolytic digestion with papain was noted, in 
accordance with the case of the native protein (see above). In addition a high 
degree of trypsin resistance was noted. 

We chose to investigate direa binding of metal radioisotopes ('^Fe, ^^Zn 
and ^Mn) to the recombinant protein. Iron proved to be notoriously difficult to 
work with, since ferric iron is essentially insoluble at neutral pH. Precipitation 
problenfis therefore prevented reasonable interpretation of several assays tried. 
Copper radioisotopes decay too rapidly to handle practically. Interestingly, an 
interaction with Zn could be demonstrated. GST and MtsA were incubated with 
^Zn, in the presence of 0.25 M NaCi, and then subjected to gel filtration. 
Fractions were collected, and their protein concentration and radioactivity was 
measured. The result (Fig, 3 A) indicates comigration of ^^Zn with MtsA, but not 
with GST. When the molar amount of comigrating ^^Zn is compared with the 
protein content, a 60% saturation of the protein is found (assuming a single 
binding site). Similarly incubated GST and MtsA were subjeaed to native PAGE, 
and then autoradiography. Radioactivity was seen at the place of MtsA, but not 
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for GST (data not shown), and this could be inhibited by adding ZnCl2 to the 
initial incubation. 

A slot blot assay was also performed, applying MtsA and GST onto a 
nitrocellulose membrane in dilution series. The membrane was then incubated 
with ^^Zn, and, following washing steps, radioactivity was found in the MtsA 
fusion sample, but not in the GST sample (data not shown). This interaaion 
could be efficiently inhibited (88%) by addition of 1 mM ZnCl2 during incubation. 
A varying degree of inhibition was seen when adding other metal salts. Among 
these, only Cu(n) inhibited the interaaion to a high degree (74%). A similar 
experiment was performed with ^'Fe(III), which also bound to MtsA but not to 
GST (data not shown). The binding was completely inhibited by addition of 1 
mM Fe(lll) citrate, but only weakly inhibited by 1 mM CuCl^ and MnCli- 
Addition of Fe(Il)-sulfate or ZnClj caused imspecific interaction with the 
membrane, probably due to precipitation. All these assays were also performed 
with ^^Mn, but no interaction between the radioisotope and any of the proteins 
was found. 

Example 6 - Gene and protein prevalence 

Genomic DNA was isolated and subjected to PGR, using the primers as 
listed, and an annealling temperature of 45 degrees. The primers u5ed for PGR 
amplification of MtsA gene included: 

LUND MTSA 5'^TAG-TAG.CGA-ATT-CGT^CGA-GTG-GCG-CTA-3' 
(SEQIDNo3) 

MTSA 4 5'^AGC-ACA-ACT-CGA'GAA-TCG-CTG-TGC-TTT-A-3' 
(SEQ ID No 4) 

The PGR reagents and reagent volumes used for each sample were as follows: 
IOa^I GeneAmp lOX PGR Buffer U & MgCl^, LUND MTSA primer \QuU, 
5u\ MTSA 4 primer lO.uM, DNTP mixture 2.5mM (dATP, dCTP, dGTP, 
dTTP), 67.5iul dH^O, Baaerial DNA template, 0,5^1 AmpliTaq Gold DNA 
polymerase. The thermocycler PGR settings were as follows: A. 94 "^G for 10 
minutes, B. 95°G for 1 minute, G. 45''G for 1 minute, D. 74°G for 1 minute, E. 
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74 ""C for 7 minutes, F, 0**C cubes stored indefinitely. Steps B-D were cycled 35 
times. All 23 strains have deteaable PGR produa in the prediaed size range. The 
results are set out in Table I below. 

5 Sequence conservation 

The MtsA PGR products from three strains of GAS have been sequenced, 
resulting in approximately 750bp from each. The sequences showed a liigh level of 
conserx'^ationj compared to the sequence of the original API strain. The strains 
1 0 sequenced were Ml AP 1, Tl 1 AP74 and Ml 3686-98. The sequencing primers 
were identical to those used for the PGR reaction. 

Example 7 - Generation of antiserum 

From the genbank MtsA sequence, a number of peptides have been selected 
1 5 based on predicted antigenicity. These are as follows: 

Name Sequence (including terminal C) length 

SpyLP^TDSll (259-280) C^^TDSIAKKGKP GDSYYAMMKW N-GOOH 22 

Spy-LP-ESSl6 (234-250) G^ESSVDRRPME TVSKDS-COOH 17 

20 Spy-LP-KQL17 (136-153) C^'-KQLIAKDPKN KETYEKN-GOOH 18 

Spy-LP^EIN 19 (204-223) G*EINTEEEGTP DQISSLIEK-GOOH 20 

A further peptide, QDP13, has also been highlighted as a potential vaccine 
candidate, based on it's ability to raise antisera in rabbits which reacts with the 
25 MtsA protein (data not shown). 

QDP13 QDPHEYEPLPEDV 13 



30 



The first four peptides have been conjuguated to KLH and used to immunize two 
sheep each as follows: the peptides were derived from peptide synthesis by 
Genosys Biotechnologies Inc. They are supplied as conjugated lyophilized powder 
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in 0.5mg aliquots in capped vials. The peptides were conjugated to keyhole limpet 
hemocyanin. C*^ is a cysteine insert for attachment to a hetero bifunctional linker. 

In addition, there were BSA-conjugates of the peptides. They were stored 
as lyophilized powder in 0.5mg aliquots in capped vials. The BSA-conjugates were 
used for ELISA plate immobilization/evaluation. 

For each immunisation the following regent mixture was administered: 

• A fixed volume of 3.25 ml Freund's complete/incomplete adjuvant. 

• Img of peptide conjugate in 1.3 ml volume of saline for the primary 
immunisation at day 0 or 0.5mg of peptide conjugate in 1.3 ml volume of 
saline for the subsequent immunisations. The immunisation mixture was 
achieved by emulsifying the peptide<onjugates in saline with the Freund's 
adjuvant. The immunisation mixture was injeaed into 6 subcutaneous sites 
for each sheep. 

These antisera have been tested by ELISA for their abiUty to bind to the 
immunizing peptides. Peptide was coated onto microtitre plates (100^^1/ well) at a 
concentration of 5^ig/ ml, in 0.05M carbonate-bicarbonate buffer pH 9.6. The 
plates were incubated for 1 hour at 37°C, The plates were then washed x5 with 
PBS-T (250Ml/well) and blocked with 1% BSA/PBS-T (lOOiul/well) for 1 hour at 

After washing the plates x3 with PBS-T, pre and post immune sera from 
sheep immunised with peptide conjugate vaccine candidates including FCA/Spy- 
LP.TDS21-KLH, FCA/Spy-LP-ESS16-KLH, FCA/Spy-LP'KQL17-KLH and 
FCA/Spy-LP-EIN19^KLH were diluted 1/10,000 with PBS-T. The sera were then 
incubated on plates coated with the corresponding peptide (lOOul sera/well) for 1 
hour, at 37''C. The plates were washed x3 with PBS-T and incubated with donkey 
anti-sheep IgG/peroxidase conji^ate (1/1000 in PBS-T) for 1 hour at 37''C. After 
washing x5 with PBS-T, the plates were incubated with O.lmg/ml TMB substrate 
(lOOAil/well) for 10 minutes and then the reaction was stopped with 2M H^SO^ 
(50.ul/well). Absorbances were read at 450nm. 

Figure 4 shows that both sheep immunized with each peptide raised strong 
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antisera against the corresponding peptide (eight sheep in total). Figure 5 shows 
that the anti-peptide antisera also bound to whole MtsA protein, demonstrating 
the ability of the peptides to raise antisera which may bind to natural protein on 
cells. That this binding is specific was demonstrated in a peptide blockii^ ELISA 
5 experiment, where peptide was titrated into the sera prior to reacting the sera in an 
EUSA against whole protein (figure 6). 



Example 8 - Western Blot analysis 

Various strains of S.pyogenes were tested by Western Blot> using the sheep 

10 anti-TDS 21 antisera to detect the protein. In particular, Western Blots from SDS- 
PAGE oi S.pyogenes proteins were incubated with 30 ml 5% (W/V) milk 
powder/PBS as a blocking solution. iOiA (l/lOOO) sheep antiserum (anti-TDS21) 
was added in 30 ml 5% (W/V) milk powder PBS and incubated for 60 mins at 
room temperature. After 3 washes, 3.0 fA (1/10000) of donkey anti-sheep IgG 

15 peroxidase in 30 ^1 5% (W/V) milk powder/PBS was added and incubated for 60 
minutes, washed and peroxidase substrate added. All 23 strains had detectable 
protein. 

The results along with the PGR results are set out below. 



20 



25 



30 
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Table 1 - MTSA Gene and Protein Distribfution in group A Streptococcus, as 



determined by PGR analysis and Western Blot. 



20 



Group A 
Strain 
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No 
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gene 
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+ means genomic DNA PGR product or protein was deteaable, TBD to be 
determined. 

Example 9 - Immunisation studies 
5 The Spy-LP-ESS16 peptide was also used to immunize rats. The rats were 

treated intraperitoneally with a dose volume of Iml/animal. Control groups 
received 1ml saline, lOO^^g keyhole limpet hemocyanin (RLH) in 1ml saline, 500/^1 
of Freund's complete adjuvant (FCA) plus 500/il saline or Img aluminum 
hydroxide in 1ml total volume. Peptide was administered at 100//g/ml. Rats were 

1 0 given a boost on day 21 and bled on day 42. Plates were coated with peptide at 
5A^g/ml in carbonate buffer. Serum was titrated against the peptide in 10 fold 
dilutions from 1 in 100 in PBS and binding detected using an anti-rat HRP 
conjugate and TMB substrate. To summarize this data, the control sera, sera from 
rats injected with free peptide in the absence of adjuvant, sera from rats injeaed 

15 with free peptide in FCA, and sera from rats injected with free peptide in 

alhydrogel (alum) did not react to the ESS 16 peptide bound to plates. In contrast, 
sera from rats injected with peptide KLH conjugates or peptide KLH conjugates in 
alum or CFA all contained substantial antibody titres. 

20 Example 10 - Analysis of natural human immunity 

The TDS21 peptide was chosen for further studies of endemic human 
immunity. Free TDS21 peptide was coated onto ELISA plates and reaaed with 
sera obtained from Thai volunteers. The Thai population under study has a high 
prevalence of infection with GAS, and a high incidence of rheumatic heart disease 

25 (RHD). Normal and RHD responses were compared. The results showed clearly 
that both normal and RHD sera may contain antibodies to TDS21, and in fact the 
incidence of such antibodies appears to be higher in the normal population (data 
not shown). Thus> the TDS21 epitope is recognized during natural exposure to 
GAS. 

30 Similarly, the TDS21 and QDP13 peptides were used to stimulate 

lymphocytes from the same Thai individuals. As with antibodies, the TDS21 
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response was more common in the control than the RHD population. QDP13 
stimulated only cells from RHD individuals. 
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CLAIMS 

1 . A polypeptide which comprises: 

(a) the amino acid sequence of SEQ ED NO 1, 

(b) a variant of (a) which is capable of binding an anti-MtsA antibody, 
or 

(c) a fragment of (a) or (b) of at least 6 amino acids in length which is 
capable of binding an anti-MtsA antibody. 

2. A vaccine composition comprising a polypeptide which comprises: 

(a) the amino acid sequence of SEQ ID NO 1, 

(b) a variant of (a) which is capable of generating an immune response 
to a Streptococcus or 

(c) a fragment of (a) or (b) of at least 6 amino acids in length which is 
capable of generating an immune response against a Streptococcus. 

3. A vaccine composition according to claim 2 wherein the Streptococcus is a 
group A Streptococcus. 

4. A polynucleotide having a sequence which is 

(i) the nucleotide coding sequence of SEQ ID NO 1 or the sequence 
complementary thereto, 

(ii) a nucleoride sequence which selectively hybridises to a said sequence 

(i) or a fragment thereof, or 

(iii) a nucleotide sequence which codes for a polypeptide having the 
same amino acid sequence as that encoded by a said sequence (i) or 

(ii) . 

5. A polynucleotide according to claim 4 wherein the sequence (i), (ii) or (iii) 
encodes a polypeptide capable of generating an immune response to a 
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Streptococcus. 

6. An expression vector comprising a polynucleotide according to claim 4 or 5 
operably linked to a regulatory sequence, 

7. A host cell transformed with a polynucleotide of claim 4 or claim 5. 

8. A process of producing a polypeptide suitable for use in vaccination against 
a Streptococcus comprising maintaining the host cell as defined in claim 7 
under conditions to provide expression of the polypeptide and recovering 
the expressed polypeptide. 

9. An antibody specific for a polypeptide as defined in claim 1, 

10. A method of vaccinating a patient against a Streptococcal infection, which 
method comprises administering to the patient an effective amount of a 
polypeptide as defined in claim 1 or a vaccine composition according to 
claim 2 or 3. 

11. A vaccine composition comprising a polypeptide as defined in claim 1 or 
claim 2 and a pharmaceutically acceptable carrier. 

12. A polypeptide according to claim 1 or a vaccine composition according to 
claim 2 for use in generating a protective immune response in an individual 
to a streptococcus. 
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SEQUENCE LISTING 

<110> ACTINOVA LTD 
<120> PROTEIN 
<130> 

<150> 6B 9828880.6 
<151> 1998-12-31 



<160> 9 



<170> Patentin Ver. 2.1 

<210> 1 

<211> 893 
<212> DNA 

<213> Streptococcus pyogenes 



<220> 
<221> CDS 
<222> (1)..(861) 



<400> 1 

teg teg act ggc get aaa act get aag agt gat aaa tta aaa gtc gtg 48 
Ser Ser Thr 61 y Ala Lys Thr Ala Lys Ser Asp Lys Leu Lys Val Val 
15 10 15 

gca acc aat tea att att gee gac atg aea aaa get att get ggt gat 96 
Ala Thr Asn Ser He He Ala Asp Met Thr Lys Ala He Ala Gly Asp 

20 25 30 

aaa ate gat ctg cac age att gtg eca ate ggt caa gac cct eat gag 144 
Lys He Asp Leu His Ser He Val Pro He Gly Gin Asp Pro His Glu 
35 40 45 
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10 



15 



20 



25 



tac gaa 
Tyr Glu 
50 

att ttc 
He Phe 
65 

acc aaa 

Thr Lys 

gcc gtg 
Ala Val 

aaa gga 
Lys 61 y 

att tat 
He Tyr 
130 
aac aaa 
Asn Lys 
145 

gaa aaa 

Glu Lys 



cca tta 
Pro Leu 

tat aat 
Tyr Asn 

eta gtg 
Leu Val 

tct gat 
Ser Asp 
100 

aaa gaa 
Lys Glu 
115 

tea aaa 
Ser Lys 

gaa act 
Glu Thr 

eta gae 
Leu Asp 



cca gaa 
Pro Glu 

ggt ate 
Gly He 
70 

aaa aat 
Lys Asn 
85 

ggc att 
Gly He 

gat cca 
Asp Pro 

aac att 
Asn He 

tat gaa 
Tyr Glu 
150 
aaa gaa 

Lys Glu 
165 

att gtg 
He Val 



aat aaa aaa ttg 
Asn Lys Lys Leu 
180 

aaa get tac ggt gtc cca 
Lys Ala Tyr Gly Val Pro 
195 

gaa gaa gga aca cca gat 



gat get gaa 

Asp Ala Glu 
55 

aat eta gaa 

Asn Leu Glu 

get caa aaa 
Ala Gin Lys 

gat gtg att 
Asp Val He 
105 

cat get tgg 
His Ala Trp 

120 
gee aaa caa 
Ala Lys Gin 
135 

aag aac eta 
Lys Asn Leu 

gee aaa tea 
Ala Lys Ser 

act agt gaa 
Thr Ser Glu 
185 

tct get tat 
Ser Ala Tyr 

200 
caa att tea 



aaa aca agt 
Lys Thr Ser 
60 

gat ggc ggg 
Asp Gly Gly 
75 

aeg aaa aac 
Thr Lys Asn 
90 

tac ttg gaa 
Tyr Leu Glu 

tta aat etc 
Leu Asn Leu 

ttg att gca 
Leu He Ala 
140 

aaa get tat 
Lys Ala Tyr 

155 
aaa ttt gat 

Lys Phe Asp 
170 

ggc tge ttc 
Gly Cys Phe 



aat get gat gtg 192 
Asn Ala Asp Val 



caa get 
Gin Ala 

aaa gat 

Lys Asp 

ggt gea 
Gly Ala 
110 

gaa aae 
Glu Asn 
125 

aag gat 
Lys Asp 

gtg get 
Val Ala 

get att 
Ala He 



tgg ttc 240 
Trp Phe 
80 

tac ttt 288 
Tyr Phe 
95 

age gaa 336 
Ser Glu 

gga ate 384 
Gly He 

cct aaa 432 
Pro Lys 

aaa ttg 480 
Lys Leu 
160 

gca gaa 528 
Ala Glu 
175 

ttt tea 576 
Phe Ser 



aag tac 
Lys Tyr 
190 

ate tgg gaa att aac acc gaa 624 
He Trp Glu He Asn Thr Glu 
205 

tea ttg att gaa aaa eta aaa 672 
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Glu Glu Gly Thr Pro Asp Gin He Ser 

210 215 
gtc ate aag cca tct gcg ctt ttt gta 
Val He Lys Pro Ser Ala Leu Phe Val 
225 230 
cct atg gaa act gtt tct aaa gat agt 
Pro Met Glu Thr Val Ser Lys Asp Ser 
245 

ate ttt aca gat tea att get aaa aaa 
lie Phe Thr Asp Ser He Ala Lys Lys 
260 265 
tat get atg atg aaa tgg aac ctt gac 
Tyr Ala Met Met Lys Trp Asn Leu Asp 

275 280 
gcaaaataag ataacaataa agcacagcga tt 
893 



Ser Leu He Glu Lys Leu Lys 
220 

gag tea agt gtc gat aga cgc 720 
Glu Ser Ser Val Asp Arg Arg 
235 240 
ggt att cct att tat tct gag 768 
Gly lie Pro He Tyr Ser Glu 
250 255 
ggt aaa cct gge gat agt tat 816 
Gly Lys Pro Gly Asp Ser Tyr 
270 

aaa att tct gaa ggt eta 861 
Lys He Ser Glu Gly Leu 
285 



<210> 2 
<211> 287 
20 <212> PRT 

<213> Streptococcus pyogenes 

<400> 2 

Ser Ser Thr Gly Ala Lys Thr Ala Lys Ser Asp Lys Leu Lys Val Val 
25 1 5 10 15 

Ala Thr Asn Ser He He Ala Asp Met Thr Lys Ala He Ala Gly Asp 

20 25 30 

Lys He Asp Leu His Ser He Val Pro He Gly Gin Asp Pro His Glu 
35 40 45 

30 Tyr Glu Pro Leu Pro Glu Asp Ala Glu Lys Thr Ser Asn Ala Asp Val 
50 55 60 
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He Phe Tyr Asn Gly He Asn Leu Glu Asp Gly Gly Gin Ala Trp Phe 
65 70 75 80 

Thr Lys Leu Val Lys Asn Ala Gin Lys Thr Lys Asn Lys Asp Tyr Phe 
85 90 95 

5 Ala Val Ser Asp Gly He Asp Val He Tyr Leu Glu Gly Ala Ser Glu 
100 105 110 

Lys Gly Lys Glu Asp Pro His Ala Trp Leu Asn Leu Glu Asn Gly He 

115 120 125 

He Tyr Ser Lys Asn He Ala Lys Gin Leu He Ala Lys Asp Pro Lys 
10 130 135 140 

Asn Lys Glu Thr Tyr Glu Lys Asn Leu Lys Ala Tyr Val Ala Lys Leu 
145 150 155 160 

Glu Lys Leu Asp Lys Glu Ala Lys Ser Lys Phe Asp Ala He Ala Glu 
165 170 175 

15 Asn Lys Lys Leu He Val Thr Ser Glu Gly Cys Phe Lys Tyr Phe Ser 
180 185 190 

Lys Ala Tyr Gly Val Pro Ser Ala Tyr He Trp Glu He Asn Thr Glu 

195 200 205 

Glu Glu Gly Thr Pro Asp Gin He Ser Ser Leu He Glu Lys Leu Lys 
20 210 215 220 

Val He Lys Pro Ser Ala Leu Phe Val Glu Ser Ser Val Asp Arg Arg 
225 230 235 240 

Pro Met Glu Thr Val Ser Lys Asp Ser Gly He Pro He Tyr Ser Glu 
245 250 255 

25 He Phe Thr Asp Ser He Ala Lys Lys Gly Lys Pro Gly Asp Ser Tyr 
260 265 270 

Tyr Ala Net Met Lys Trp Asn Leu Asp Lys He Ser Glu Gly Leu 
275 280 285 
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<211> 27 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 3 

tagtagcgaa ttcgtcgact ggcgcta 27 

<210> 4 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<400> 4 

agcacaactc gagaatcgct gtgcttta 

<210> 5 
<2ll> 17 
<212> PRT 

<213> Streptococcus pyogenes 
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<400> 5 

Lys Gin Leu He Ala Lys Asp Pro Lys Asn Lys Glu Thr Tyr Glu Lys 

15 10 15 

Asn 



<210> 6 
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<211> 19 
<212> PRT 

<213> Streptococcus pyogenes 
5 <400> 6 

Glu He Asn Thr Glu Glu Glu Gly Thr Pro Asp Gin He Ser Ser Leu 

15 10 15 

He Glu Lys 

10 <210> 7 
<211> 16 
<212> PRT 

<213> Streptococcus pyogenes 
15 <400> 7 

Glu Ser Ser Val Asp Arg Arg Pro Met Glu Thr Val Ser Lys Asp Ser 
15 10 15 

<210> 8 
20 <211> 21 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 8 

25 Thr Asp Ser He Ala Lys Lys Gly Lys Pro Gly Asp Ser Tyr Tyr Ala 
15 10 15 

Met Met Lys Trp Asn 
20 

30 <210> 9 
<211> 13 
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<212> PRT 

<213> Streptococcus pyogenes 
<400> 9 

Gin Asp Pro His Glu Tyr Glu Pro Leu Pro Glu Asp Val 
15 10 
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